Picture for Zixuan Yang

Zixuan Yang

MERIT: Matching Expertise via Rubric-Informed Training for Reviewer Assignment

Add code
May 27, 2026
Viaarxiv icon

Tournament-GRPO: Group-Wise Tournament Rewards for Reinforcement Learning in Open-Ended Long-Form Generation

Add code
May 26, 2026
Viaarxiv icon

JADE: Bridging the Strategic-Operational Gap in Dynamic Agentic RAG

Add code
Jan 29, 2026
Viaarxiv icon

RATE: Reviewer Profiling and Annotation-free Training for Expertise Ranking in Peer Review Systems

Add code
Jan 27, 2026
Viaarxiv icon

Beyond Monolithic Architectures: A Multi-Agent Search and Knowledge Optimization Framework for Agentic Search

Add code
Jan 08, 2026
Viaarxiv icon

Exploring Human-Like Thinking in Search Simulations with Large Language Models

Add code
Apr 10, 2025
Viaarxiv icon

Learning Cascade Ranking as One Network

Add code
Mar 12, 2025
Viaarxiv icon

Adaptive$^2$: Adaptive Domain Mining for Fine-grained Domain Adaptation Modeling

Add code
Dec 11, 2024
Figure 1 for Adaptive$^2$: Adaptive Domain Mining for Fine-grained Domain Adaptation Modeling
Figure 2 for Adaptive$^2$: Adaptive Domain Mining for Fine-grained Domain Adaptation Modeling
Figure 3 for Adaptive$^2$: Adaptive Domain Mining for Fine-grained Domain Adaptation Modeling
Figure 4 for Adaptive$^2$: Adaptive Domain Mining for Fine-grained Domain Adaptation Modeling
Viaarxiv icon

Scaling Laws for Online Advertisement Retrieval

Add code
Nov 20, 2024
Figure 1 for Scaling Laws for Online Advertisement Retrieval
Figure 2 for Scaling Laws for Online Advertisement Retrieval
Figure 3 for Scaling Laws for Online Advertisement Retrieval
Figure 4 for Scaling Laws for Online Advertisement Retrieval
Viaarxiv icon

Reinfier and Reintrainer: Verification and Interpretation-Driven Safe Deep Reinforcement Learning Frameworks

Add code
Oct 19, 2024
Viaarxiv icon